244 research outputs found

    Expression and Efficient Processing of Fuzzy Queries in a Graph Database Context

    Get PDF
    International audienceGraph databases have aroused a large interest in the last years thanks to their large scope of potential applications (e.g. social networks, biomedical networks, data stemming from the web). In a similar way as what has already been proposed in relational databases, defining a language allowing a flexible querying of graph databases may greatly improve usability of data. This paper focuses on the notion of fuzzy graph database and describes a fuzzy query language that makes it possible to handle such database, which may be fuzzy or not, in a flexible way. This language, called FUDGE, can be used to express preference queries on fuzzy graph databases. The preferences concern i) the content of the vertices of the graph and ii) the structure of the graph. The FUDGE language is implemented in a system, called SUGAR, that we present in this article. We also discuss implementation issues of the FUDGE language in SUGAR

    Fuzzy Quantified Queries to Fuzzy RDF Databases

    Get PDF
    International audienceIn a relational database context, fuzzy quantified queries have been long recognized for their ability to express different types of imprecise and flexible information needs. In this paper, we introduce the notion of fuzzy quantified statements in a (fuzzy) RDF database context. We show how these statements can be defined and implemented in FURQL, which is a fuzzy extension of the SPARQL query language that we previously proposed. Then, we present some experimental results that show the feasibility of this approach

    Expression and Efficient Processing of Fuzzy Queries in a Graph Database Context

    Get PDF
    International audienceGraph databases have aroused a large interest in the last years thanks to their large scope of potential applications (e.g. social networks, biomedical networks, data stemming from the web). In a similar way as what has already been proposed in relational databases, defining a language allowing a flexible querying of graph databases may greatly improve usability of data. This paper focuses on the notion of fuzzy graph database and describes a fuzzy query language that makes it possible to handle such database, which may be fuzzy or not, in a flexible way. This language, called FUDGE, can be used to express preference queries on fuzzy graph databases. The preferences concern i) the content of the vertices of the graph and ii) the structure of the graph. The FUDGE language is implemented in a system, called SUGAR, that we present in this article. We also discuss implementation issues of the FUDGE language in SUGAR

    Fuzzy Query By Example

    Get PDF
    International audienceThis paper describes Fuzzy Query By Example, an approach helping users retrieve data without any prior knowledge of the database schema or any formal querying language. The user is solicited to evaluate, in a binary way, pre-selected items of the database. We provide a characterization-based strategy that identifies the properties shared by the examples (resp. counterexamples) positively (resp. negatively) evaluated by the user. These properties are expressed using linguistic terms from a fuzzy vocabulary to ensure that the user has a good understanding of the inferred query

    On Dissimilarity Measures at the Fuzzy Partition Level

    Get PDF
    International audienceOn the one hand, a user vocabulary is often used by soft-computing-based approaches to generate a linguistic and subjective description of numerical and categorical data. On the other hand, knowledge extraction strategies (as e.g. association rules discovery or clustering) may be applied to help the user understand the inner structure of the data. To apply knowledge extraction techniques on subjective and linguistic rewritings of the data, one first has to address the question of defining a dedicated distance metric. Many knowledge extraction techniques indeed rely on the use of a distance metric, whose properties have a strong impact on the relevance of the extracted knowledge. In this paper , we propose a measure that computes the dissimilarity between two items rewritten according to a user vocabulary

    Processing Fuzzy Relational Queries Using Fuzzy Views

    Get PDF
    International audienceThis paper proposes two original approaches to the processing of fuzzy queries in a relational database context. The general idea is to use views, either materialized or not. In the first case, materialized views are used to store the satisfaction degrees related to user-defined fuzzy predicates, instead of calculating them at runtime by means of user functions embedded in the query (which induces an important overhead). In the second case, abstract views are used to efficiently access the tuples that belong to the α-cut of the query result, by means of a derived Boolean selection condition

    PANDA : Personnaliser les ANomalies Détectées par Apprentissage

    Get PDF
    International audienceThe paper addresses the tasks of anomaly detection and explanation simultaneously, in the human-in-the-loop paradigm integrating the end-user expertise : it first proposes to exploit two complementary data representations to identify anomalies, namely the description induced by the raw features and the description induced by a userdefined vocabulary. These representations respectively lead to identify so-called data-driven and knowledge driven anomalies. The paper then proposes to confront these two sets of instances so as to improve the detection step and to dispose of tools towards anomaly explanations. It distinguishes and discusses three cases, underlining how the two description spaces can benefit from oneanother, in terms of accuracy and interpretability.La personnalisation par prise en compte de l'expertise de l'utilisateur des processus de détection d'anomalies est le sujet principal de cet article. Les anomalies sont identifiées dans deux espaces de représentation des données, l'espace initial composé des valeurs brutes et l'espace induit par un vocabulaire flou défini par l'utilisateur sur ces attributs initiaux. L'application de toute méthode de détection d'anomalies sur ces deux espaces conduit à différencier des anomalies issues des données brutes de celles issues du vocabulaire. Afin à la fois d'améliorer la détection des exceptions et de disposer d'outils pour les expliquer, ces deux ensembles d'anomalies sont confrontés. Trois situations émanant de cette comparaison sont étudiées afin de montrer comment l'utilisation combinée des deux espaces améliore l'efficacité de la procédure de détection d'anomalies ainsi que l'interprétabilité des résultats qu'elle génÚre

    PANDA: Human-in-the-Loop Anomaly Detection and Explanation

    Get PDF
    International audienceThe paper addresses the tasks of anomaly detection and explanation simultaneously, in the human-in-the-loop paradigm integrating the end-user expertise: it first proposes to exploit two complementary data representations to identify anomalies, namely the description induced by the raw features and the description induced by a user-defined vocabulary. These representations respectively lead to identify so-called data-driven and knowledge-driven anomalies. The paper then proposes to confront these two sets of instances so as to improve the detection step and to dispose of tools towards anomaly explanations. It distinguishes and discusses three cases, underlining how the two description spaces can benefit from one another, in terms of accuracy and interpretability

    Massive Data Exploration using Estimated Cardinalities

    Get PDF
    International audienceLinguistic summaries are used in this work to provide personalized exploration functionalities on massive relational data. To ensure a fluid exploration of the data, cardinalities of the data properties described in the summaries are estimated from statistics about the data distribution. The proposed workflow also involves a vocabulary inference mechanism from these statistics and a sampling-based approach to consolidate the estimated cardinalities. The paper shows that soft computing techniques are particularly relevant to build concrete and functional business intelligence solutions

    Exploration de données massives à l'aide d'estimations de cardinalités

    Get PDF
    National audienceThis paper describes FuzViz, a tool to explore interactively massive relational data. FuzViz relies on a method building automatically linguistic summaries, that provide concise and intelligible insights in the data content. It offers an interactive view of these summaries, dynamically recomputed on demand. To ensure a fluid exploration of the data, FuzViz exploits the proposition of a highly efficient method for estimating the cardinality ofthe summary properties, estimated from statistics about the data distribution stored in the relational data base, consolidated by a sampling-based approach. The proposed workflow also involves a vocabulary inference mechanism from these statistics.Cet article présente un outil d'exploration interactive de données massives stockées dans un systÚme de gestion de base de données (SGBD), nommé FuzViz. Il repose sur une méthode de construction automatique de résumés linguistiques, qui fournissent une synthÚse concise et intelligible du contenu des données. Il offre une vue interactive de ces résumés recalculée dynamiquement selon les demandes de l'utilisateur. Pour assurer une exploration fluide des propriétés décrites par ces résumés, FuzViz s'appuie sur la proposition d'une méthode efficace d'estimations de leurs cardinalités, produites à partir des statistiques sur la distribution des données gérées par le SGBD et consolidées par une approche basée sur un échantillonnage. L'outil propose de plus un mécanisme d'inférence de vocabulaire flou à partir de ces statistiques
    • 

    corecore